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Abstract 

We  analyze  the  performance  of  receivers  which  use  envelope  detection  at  IF  to 
detect  optical  signals  with  orthogonal  modulation  formats.  We  obtain  exact  closed- 
form  expressions  for  the  error  probability  conditioned  on  the  normalized  envelope. 
The  only  information  necessary  for  obtaining  the  unconditional  error  probability  is 
the  set  of  tilted  moments  of  the  envelope.  We  then  provide  an  approximation  to  this 
envelope  which  is  not  only  accurate  to  the  first  order  in  phase  noise  strength,  but 
also  has  the  same  range  as  the  actual  random  envelope.  We  use  this  approximation 
to  obtain  the  bit  error  performance  of  the  three  receiver  models  that  we  consider. 
We  also  provide  a  tight  lower  bound  in  closed-form.  Finally  we  extend  the  analy¬ 
sis  to  N-ary  FSK  to  observe  the  improvement  due  to  increased  bandwidth  use  and 
transmitter/receiver  complexity. 
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1  Introduction 


Coherent  optical  communication  systems  provide  an  efficient  way  of  sharing  the  huge 
bandwidth  of  single  mode  optical  fiber  among  different  users  of  the  future  high-speed 
networks.  The  two  important  reasons  for  this  efiiciency  are  the  improved  receiver 
sensitivity  and  increased  channel  density  over  direct  detection  systems.  As  a  result, 
considerable  effort  has  gone  into  developing  coherent  systems  and  into  characterizing 
and  overcoming  problems  to  allow  coherent  technology  to  become  a  practical  alter¬ 
native  to  direct  detection  optical  systems.  Among  major  problems  associated  with 
coherent  technology  is  the  fact  that  the  phase  of  the  light  emitted  from  a  semicon¬ 
ductor  laser  exhibits  random  fluctuations  due  to  spontaneous  emissions  in  the  laser 
cavity  [1].  This  phenomenon  is  commonly  referred  to  as  “phase  noise”.  Phase  noise 
has  two  important  effects  on  the  performance  of  optical  networks.  The  first  of  these 
effects  is  the  broadening  of  the  spectrum  emitted  by  the  users  of  the  network,  which 
causes  interchannel  interference  and  thus  necessitates  wider  channel  spacing.  This  is 
especially  dominant  at  low  data  rates  where  channels  occupy  small  bandwidth.  The 
second  effect  is  the  incomplete  knowledge  and  the  time-varying  nature  of  the  phase 
which  makes  the  correct  retrieval  of  the  transmitted  data  bits  more  difficult  for  the 
receiver.  While  the  first  effect  results  in  inefficient  use  of  the  available  bandwidth, 
the  second  one  causes  a  degradation  of  the  bit  error  rate  of  the  transmitter/receiver 
pairs. 

It  is  the  second  effect  above  that  we  consider  in  this  paper.  This  problem  has 
received  much  attention  in  the  recent  years.  Various  performance  analyses  for  different 
modulation  formats  and  receiver  structures  exist  in  the  literature  [2,  3,  4,  5,  6,  7]. 
However,  since  researchers  use  different  sets  of  assumptions  and  approximations  in 
their  work,  it  is  difficult  to  reconcile  their  results  and  to  agree  on  the  quantitative 
effect  of  phase  noise  on  coherent  systems.  The  difficulty  seems  to  arise  from  the  fact 
that  the  Brownian  motion  model  of  the  phase  noise,  which  is  well-tested  and  agreed 
upon,  results  in  a  random  process  that  is  both  nonlinear  and  nonstationary.  Thus, 
as  one  moves  through  different  stages  of  a  nontrivial  receiver,  the  exact  statistical 
characterization  of  the  randomness  due  to  phase  noise  becomes  increasingly  difficult. 
One  needs  to  invoke  a  sequence  of  assumptions  and  approximations  to  overcome 
these  difficulties.  Naturally,  the  confidence  that  the  real  systems  will  operate  within 
a  reasonable  margin  of  the  predictions  of  the  analyses  depends  on  the  number  and 
the  nature  of  these  assumptions  and  approximations. 
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It  is  reasonable  to  expect  that  the  modulation  formats  which  are  the  least  to  suf¬ 
fer  from  phase  noise  will  be  those  with  asymmetric  constellations  in  the  signal  space, 
e.g.  on-off-keying  (OOK)  and  frequency-shift-keying  (FSK).  These  two  modulation 
formats  were  considered  recently  by  Foschini  and  his  colleagues  in  [3].  They  formu¬ 
lated  and  treated  the  problem  with  a  minimal  number  of  approximations.  Their  work 
provides  a  rigorous  framework  upon  which  the  researchers  in  the  field  may  agree  and 
improve  on.  As  a  result,  a  considerable  number  of  papers  have  resulted  from  their 
model  in  a  relatively  short  time  [7,  8,  4].  The  work  reported  here  also  conforms  to  the 
receiver  model  of  [3].  We  consider  orthogonal  modulation  formats  in  this  paper,  e.g. 
FSK  and  orthogonal  polarization  modulation.  However  the  results  can  be  general¬ 
ized  to  obtain  the  performance  of  OOK  as  well.  We  describe  the  receiver  models  and 
the  problem  in  next  section.  We  obtain  closed-form  expressions  for  the  probability 
of  error  conditional  on  the  squared  envelope  of  a  normalized  phase  noisy  signal,  for 
different  receiver  forms.  We  provide  an  approximation  which  is  close  to  the  actual 
envelope  and  whose  moments  are  readily  available.  We  compute  the  probability  den¬ 
sity  function  of  this  approximate  envelope  from  its  moments  using  a  method  based  on 
Legendre  polynomials.  We  then  use  this  density  function  to  remove  the  conditioning 
on  the  error  probability  and,  thus,  to  obtain  the  error  performance  of  the  systems 
under  consideration.  We  also  provide  a  lower  bound  to  the  error  probability  of  the 
double-filter  receiver  which  is  very  close  to  the  actual  error  probability.  Therefore  this 
lower  bound,  which  is  very  easy  to  compute,  can  be  used  to  estimate  the  performance. 

Finally,  we  extend  the  analysis  to  the  case  of  N-aij  FSK  where  we  provide  very 
tight  upper  and  lower  bounds  to  the  bit  error  probability. 


2  Problem  Description 

The  received  optical  signal  is  first  processed  by  an  optical  heterodyne  receiver  shown 
in  Figure  1  for  FSK  signals.  Polarization  modulation  would  require  a  polarization 
beam  splitter  and  a  pair  of  photodetectors.  Optical  heterodyning  transforms  the 
signal  from  optical  frequency  to  intermediate  frequency.  The  IF  signal  output  r{t)  is 
to  be  further  processed  by  the  IF  receiver.  This  IF  processing  is  the  main  focus  of 
this  paper. 

The  IF  receiver  structure  that  we  consider  in  this  paper  is  shown  in  Figure  2  where 
the  incoming  signal  is  corrupted  by  phase  noise  and  additive  noise.  The  front  end 
of  this  receiver  is  the  standard  quadrature  receiver  that  performs  envelope  detection 
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of  FSK  signals.  The  correlators  and  integrators  perform  a  bandpass  filtering  to  limit 
the  noise  power  corrupting  the  signal.  Since  the  integrators  integrate  their  inputs 
over  a  duration  T'  the  effective  bandwidth  of  the  filter  is  1/T'.  For  a  uniformly 
distributed  phase  uncertainty  which  is  constant  over  the  bit  duration  the  optimum 
value  of  T'  is  the  bit  duration  T.  However,  when  the  signal  is  corrupted  by  phase 
noise,  the  spectrum  of  the  signal  is  broadened.  Therefore  a  wider  filter  bandwidth, 
equivalently  smaller  integration  times,  may  be  necessary.  For  analytical  convenience, 
we  only  consider  the  case  where  the  ratio  T/T'  is  a  positive  integer  M  as  in  [3]. 

The  outputs  of  the  in-phase  and  quadrature  branch  integrators  in  Figure  2  are 
squared  and  then  added  to  perform  the  envelope  detection.  The  remaining  processing 
depends  on  the  value  of  M.-  For  M  =  1,  the  adder  outputs  are  sampled  at  the  end 
of  the  bit  duration  and  the  two  values  are  compared  to  reach  a  decision  about  the 
transmitted  data  bit.  We  call  the  receiver  with  M  =  1  a  single  sample  receiver  since 
only  one  sample  is  taken  during  a  bit  duration.  For  M  >  2,  the  adder  outputs  are 
sampled  every  T'  seconds  resulting  in  M  samples  per  bit.  The  way  these  samples 
are  processed  depends  on  the  complexity  that  one  desires.  One  simple  strategy  is  to 
discard  all  but  the  last  one  of  these  samples  and  to  use  the  last  sample  as  in  the  single 
sample  receiver.  A  more  efficient  way  is  to  average  these  M  samples  and  compare  the 
two  averages.  This  averaging  is  not  the  optimal 'processing  of  the  samples.  However 
since  it  can  be  performed  by  a  lowpass  filter  it  is  a  practical  reception  strategy  to 
implement. 

We  classify  the  reception  strategies  above  into  three  categories.  The  first  is  the 
single  sample  receiver,  the  second  one  is  the  multisample  receiver  with  single  fil¬ 
tering  and  the  third  one  is  the  multisample  receiver  with  double  filtering.  These 
receiver  structures  were  first  suggested  by  Kazovsky  et.  al.  in  [10]  for  ASK  mul¬ 
tiport  homodyne  receivers.  In  the  framework  of  [10],  the  single  sample  receiver  is 
the  conventional  matched-filter  receiver,  the  single-filter  receiver  is  the  conventional 
receiver  with  a  widened  filter,  and  the  double  filter  receiver  is  the  wide-band  filter- 
rectifier-narrowband  filter  (WIRNA)  structure.  It  was  also  noticed  in  [10]jthat  if  the 
M  samples  of  the  double  filter  receiver  are  viewed  as  coming  from  M  distinct  chan¬ 
nels  (or  from  a  fast  frequency-hopped  spread  spectrum  system),  then  the  receiver  is 
functionally  equivalent  to  the  noncoherent  receiver  for  multichannel  signaling  treated 
in  [11].  Some  of  our  results,  in  particular  the  conditional  error  probability  given  by 
Equation  (19),  may  be  obtained  by  using  the  results  of  [11].  However,  we  wiU  proceed 
independently  for  the  sake  of  completeness. 
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2.1  Single  Sample  Receiver 

We  first  consider  the  single  sample  receiver.  This  receiver  is  the  optimal  receiver  when 
the  phase  of  the  received  signal  has  uniform  distribution  and  is  time-invariant.  The 
error  probability  for  this  case  has  a  simple  closed- form  (1/2  [12].  We  will 

see  that  a  similar  (but  more  complicated)  result  can  be  obtained  in  the  phase  noisy 
case. 

We  assume  that  the  received  IF  signal  is  the  FSK  signal  corrupted  by  additive 
noise  and  phase  noise: 

r{t)  =  Acos(27rfit  -f  ^(t))  4-  n(t)  i  =  0, 1  (1) 


where  /o  and  /i  are  the  frequencies  for  “0”  and  “1”  respectively.  The  phase  noise 
0{t)  is  a  Brownian  motion  process  described  by 


9{t)  =  2Tr  f  ^t(r)dr  (2) 

Jo 

with  /x{t)  being  a  zero- mean  white  Gaussian  process  with  two-sided  spectral  density 
/?/27r.  Then  0{t)  is  a  zero-mean  Gaussian  process  with  zero-mean  and  variance  2ir/3t’, 
is  the  combined  linewidth  of  the  source  and  local  oscillator  lasers.  The  additive 
noise  7i(t)  in  (1)  is  a  white  Gaussian  process  with  two-sided  spectral  density  iVo/2.  We 
assume  that  the  difference  between  fi  and  /o  is  much  larger  than  the  filter  bandwidth 
1/T',  so  that  when  a  “0”  is  transmitted,  the  integrator  outputs  of  the  lower  branch 
integrators  in  Figure  2a  contain  no  signal  component,  and  vice  versa.  With  this 
assumption,  when  a  “0”  is  transmitted  the  sampler  outputs  at  time  T  can  be  written 
as 


Fo  = 


Fi  = 


—  /  dt  +  rico  +  jn,o 
2  Jo 

[nci  +  jn^i  f 


(3) 


where  nd  and  are  statistically  independent  and  identically  distributed  (i.i.d.) 
Gaussian  random  variables  with  zero-mean  and  variance  cr^  =  NoTfA.  Due  to  sym¬ 
metry,  the  error  probability  is  given  by 


n-Pr(Fo<Fi) 


(4) 


We  will  now  condition  the  error  probabihty  on  the  amplitude  of  the  phase  noise 
integral  in  (3).  We  define 


T 


dt 


(5) 
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Then  the  conditional  density  of  lo  can  be  easily  found  as^ 

p(So|y)  =  (6) 

while  Yi  has  an  exponential  density  given  by 

P(!/1)  =  ■  (7) 

Therefore  the  conditional  error  probability  comes  as  a  result  of  the  classical  problem 
of  incoherent  detection  of  orthogonal  signals  as  [12] 

P.IY)  =  .  (8) 

Let’s  now  express  Y  in  a  different  form.  From  (5),  we  have 

r  =  4?  I 

2  Mo 

Since  $[Tu)  has  variance  2iTf3Tu,  Y  can  be  written  in  terms  of  standard  Brownian 
motion  ^(t)  (i.e.  that  with  variance  t)  as 

Y  =~\f^  dt 

2  Mo 

where  7  =  2'k^T.  Now  if  we  define  the  random  variable  ^(7)  as 

X(7)  =  I  dt  '  (9) 

170 

(8)  can  be  written  in  terms  of  X  as 

P,{X)  =  (10) 

Zi 

where  ^  =  A^Tf2No  is  the  signal-to-noise  ratio.  Therefore  Pg  is  given  by 

Pe  =  Ex  •  (11) 

In  the  case  with  no  phase  noise  (7  =  0),  this  reduces  to  l/2e“^/^  as  it  should. 

An  immediate  implication  of  (11)  is  that  there  does  not  exist  a  bit-error-rate- 
floor,  i.e.  a  nonzero  error  probability  that  exists  even  when  ^  tends  to  00,  for  the 

^For  notational  convenience  we  indirectly  address  the  random  variable  that  a  density  function 
refers  to  with  its  argument,  i.e.,  p(y)  stands  for  pr(j/)  in  the  more  standard  notation. 
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single  sample  receiver,  provided  that  the  probability  density  function  of  does 

not  contain  a  delta  function  at  the  origin 

To  determine  Pg  from  (11)  we  have  to  obtain  either  the  probability  density  function 
or  the  moment  generating  function  of  X.  Therefore  the  random  variable  X  plays  a 
key  role  in  the  performance  analysis  of  the  single  sample  receiver.  In  the  next  section, 
we  will  show  that  this  is  also  true  for  the  multisample  receivers  that  we  consider. 

2.2  Multisample  Receivers 

In  the  previous  section,  we  mentioned  that  the  receiver  may  use  a  shorter  integration 
time  T'  than  the  full  bit  duration  T.  The  advantage  of  this  scheme  is  that  the  phase 
is  not  allowed  to  wander  too  much  so  that  the  amount  of  collected  signal  energy  may 
not  drop  below  undesirable  levels.  Equivalently,  the  bandwidth  of  the  IF  filter  is 
increased  to  pass  more  of  the  signal  power.  If  only  the  last  of  the  M  samples,  the 
one  at  t  =  T,  is  used,  then  the  received  signal  is  effectively  processed  only  during 
the  interval  {T  —  T',T),  hence  there  is  a  drop  in  the  received  signal-to-noise  ratio 
by  a  factor  of  1/M.  However,  since  the  phase  is  allowed  to  change  in  an  interval 
of  length  T',  the  effective  value  of  7  also  drops  by  a  factor  of  1/M.  (This  can  be 
observed  by  repeating  the  calculations  of  the  previous  section  when  T  is  replaced  by 
T'.)  Therefore  the  probability  of  error  for  the  multisample  receiver  with  single  filter 
is  given  by 

P,  =  Ex  .  (12) 

.  2 

An  inspection  of  (12)  shows  that  when  the  signal-to-noise  ratio  and/or  the  phase 
noise  strength  is  small,  the  optimal  value  of  M  will  be  small.  However,  for  large 
signal-to-noise  ratios  and  large  phase  noise  values,  it  may  be  worthwhile  to  sacrifice 
from  the  signal  energy  while  alleviating  the  effect  of  phase  noise. 

The  single- filter  receiver  uses  only  one  of  the  M  available  samples.  Thus  the 
single-filter  receiver  is,  in  fact,  a  single  sample  receiver  with  widened  front  end  filter 
bandwidth.  The  efficiency  of  this  receiver  would  be  improved  if  all  the  M  samples 
were  used  in  the  decision  process.  A  simple  processing  of  these  samples  is  averaging, 
which  may  be  accomplished  by  low-pass-filtering  the  sum  of  the  outputs  of  the  square- 
law  devices  [3].  The  result  is  the  multisample  double-filter  receiver  that  we  analyze 
below.  Note  that  this  receiver  is  not  the  (still  unknown)  receiver  which  minimizes  the 

®Note  that  the  inherent  error  floor  in  FSK  due  to  the  finite  frequency  separation  [13,  5]  is  neglected 
in  our  model. 
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probability  of  error.  It  is  closely  related  to  the  optimal  receiver  for  the  case  where 
the  phase  noise  process  9{t)  is  piecewise  constant  and  takes  M  independent  uniformly 
distributed  values  during  a  bit  interval.  In  this  case,  the  optimal  receiver  averages 
the  square  root  of  the  samples  when  the  signal-to- noise  ratio  is  high. 

In  the  case  of  a  “0”  being  transmitted,  the  lowpass  filter  outputs  will  be 

M 

>0  =  X!  I®(^)  ■+■  ‘^co{k)  +  jn^o{k)f  (13) 

k=l 

M 

=  1]  hci(A:)  (14) 

fc=i 

where  x{k)  is  given  by 

4  rfer' 

^(k)  =  TT  I 

and  all  the  additive  noise  samples  are  i.i.d.  zero-mean  Gaussian  with  variance 

a'  =  NoT'/A  . 


Since  0{t)  has  independent  increments,  x{k)  are  statistically  independent.  Further¬ 
more,  since  9{t)  has  stationary  increments,  |.'c(A:)|  will  be  identically  distributed'*. 
Then  each  of  the  terms  in  (13)  and  (14)  obey  the  distributions  in  (6)  and  (7)  respec¬ 
tively.  [Y  in  (6)  is  replaced  by  |a;(A:)|.)  In  particular,  since  each  term  in  (14)  has 
i.i.d.  exponential  distribution,  li  has  a  Gamma  distribution  with  parameters  M  and 
l/2a^.  It  is  convenient  to  normalize  To  and  Yi  via 


Then  hi  has  the  density  function 


p(vi)  = 


-e-"‘  m  >  0 


(M-  1)! 

Due  to  symmetry,  the  probability  of  error  is  given  by 


(15) 


Pr(Fo  <  Fi) 


L 


k=0 


ki 


e  '"‘p{vo)  dvo 


(16) 


“^This  also  implies  that  the  envelopes  preserve  their  statistical  properties  from  one  bit  to  another. 
So  we  can  consider  the  first  bit  in  the  performance  analysis  without  loss  of  generality. 
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where  p(fo)  is  the  density  function  of  Vq.  This  density  appears  to  be  difficult  to  obtain 
in  closed-form.  Therefore,  we  will  first  condition  (16)  on  the  phase  noisy  envelopes. 
Given  {x{k)  :  1  <  k  <  M},  Vq  is  the  sum  of  squares  of  2M  independent  Gaussian 
random  variables,  the  density  of  which  is  given  by  [12] 

p{vo\{x{k)})  =  \—j  (Vdrvo)  vo  >  0  (17) 

where  /m-i(-)  is  the  modified  Bessel  function  of  order  M  —  1  and  r  is  defined  as 

I  Af 

Note  that  the  conditional  density  of  Vq  depends  on  {a:(A:)}  only  through  the  sum  of 
their  squared  amplitudes.  Tliis  enables  us  to  proceed  further  and  to  express  the  error 
probability  conditioned  on  r  as 


Pe(r)  =  VT{Vo<V,\r) 


^  ^  1  /•oo  /  7*  \  (AT — 1)/2  . 

=  E  h.  L  (?)  ■ 


(18) 


It  is  shown  in  Appendix  A  that  Pe(r)  can  be  expressed  as 

j-,/2  /*  +  -V  -  A  1 


P.(r) 


E 


E 


OM  ^ 

^  n=0  fc=n 


(19) 


k  ~n  /  2** 

Now  we  relate  r  to  the  envelope  of  the  standard  Brownian  motion  using  the 
techniques  of  the  previous  section.  From  the  definition  of  r  and  x{k)  we  have 


hi 


kT/M 


I  . 


8a2  \m 


E 

k~l 


Jk-i 


where  ip{t)  is  the  standard  Brownian  motion.  Defining 


^kil) 


Jk-i 


and  using  the  definitions  of  the  additive  noise  variance  cc^  and  the  signal-to-noise  ratio 
(SNR)  we  obtain 


1 


M 


(20) 


k=l 
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Therefore  r  is  the  SNR  weighted  by  the  sample  average  of  the  M  normalized  phase 
noisy  envelope  squares,  it  can  be  viewed  as  a  phase  noisy  signal-to-noise  ratio.  As 
M  oo,  the  sample  average  tends  to  the  mean  of  Xk  with  probability  1,  by  the  law 
of  large  numbers.  Also,  the  mean  of  Xk{'ylM)  tends  to  1  as  ilf  — >  oo.  Therefore 
r  tends  to  which  is  the  maximum  value  it  can  assume  since  Xk  <  1  by  Schwartz 
inequality. 

The  conditional  error  probability  given  by  (19)  is  shown  in  Figure  3  as  a  function 
of  r.  As  expected  Pe{r)  decreases  as  r  gets  larger  while  M  is  fixed.  However,  for 
a  fixed  value  of  r,  Pe{r)  increases  with  M.  Then,  there  is  a  tradeoff  between  the 
collected  energy  of  the  phase  noisy  signal  and  the  additive  noise  energy  corrupting 
the  signal  as  M  changes.  There  must  exist  an  optimum  value  of  M  which  balances 
the  two  conflicting  noise  effects.  This  behavior  is  also  observed  in  [3]. 

We  can  readily  obtain  a  lower  bound  to  the  error  probability  without  calculating 
the  density  of  ^(7).  We  show  in  Appendix  B  that  Pe{r)  as  given  by  (19)  is  a  convex  U 
function  of  r  for  any  value  of  M.  The  unconditional  error  probability  is  the  expected 
value  of  Pe{ry,  thus  replacing  r  by  its  mean  results  in  a  lower  bound  by  Jensen’s 
inequality  [14].  Using  (20),  one  has 

Pe>Pe(F?[X(7/M)])  .  (21) 

The  expected  value  of  A’'(7)  will  be  found  in  the  next  section.  Figure  4  shows  the 
lower  bound  in  (21)  with  the  use  of  (34)  of  the  next  section.  At  each  value  of  SNR  ^ 
and  the  phase  noise  strength  7,  the  lower  bound  has  been  minimized  with  respect  to 
M. 

The  overall  error  probability  Pe  is  obtained  from  Pe{r)  by  removing  the  condition¬ 
ing  over  r.  If  Pa{r)  denotes  the  probability  density  function  of  r,  then  we  have 

Pe  =  y  Pc{r)pair)  dr  .  (22) 

Since  r  is  a  weighted  average  of  i.i.d.  random  variables  Xk  as  given  by  (20),  the  density 
of  r  is  given  in  terms  of  the  density  of  Xk  by  an  M-fold  convolution.  Then,  we  see 
that  the  density  of  the  phase  noisy  squared-envelope  is  again  the  only  information  we 
need  to  obtain  the  error  probability.  The  convolutions  must  be  performed  for  each  M 
since  the  individual  densities  to  be  convolved  correspond  to  the  envelope  with  phase 
noise  strength  7/M.  Therefore  the  computation  becomes  very  costly  with  many 
convolutions  to  be  performed.  We  describe  an  alternative  method  which  eliminates 
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the  need  for  convolutions.  From  (19),  the  error  probability  can  be  written  as 


'M-l 

p,  =  E, 

.  n=0 


-r/2 


(23) 


where  the  coefficients  On  are  defined  as 


a„.  = 


2^n! 


JW-l 

E 

fe=n 


'k  +  M  - 
k  —  n 


1 

¥ 


(24) 


Using  the  sample  average  expression  for  r  given  in  (20)  one  gets 


M-l 


n=0 


'X' 

2M , 


E 


/  M  \ 

(  s  ^  {.l/M) 

\fe=l  /  \  fc=l  ) 


Finally  expanding  the  expression  above  in  a  multinomial  sum  and  using  the  indepen¬ 
dence  of  the  X\f  we  obtain 


Af-l 


n=0 


2M 


E 


71 : 


M 


fel  H - 


fei!  •  •  •  kml  Si 


n 


where 


a{k)  =  E 


A’’*’(7/M)exp 


2M 


X{ilM) 


(25) 


(26) 


The  need  for  convolutions  is  eliminated  by  this  expansion.  All  that  is  necessary 
about  the  envelope  is  a  set  of  tilted  moments  a{k).  This  reduces  the  computations 
considerably  as  will  be  seen  in  Section  5.  Further  computational  savings  can  be 
obtained  by  observing  that  the  inner  summation  in  (25)  contains  many  identical 
terms:  any  permutation  of  an  M-tuple  (^i, ^2) •  •  •  ? ^m)  results  in  the  same  term. 
Therefore  the  summation  may  be  limited  to  ordered  tuples  with  ki  >  k2  >  •  •  •  >  k^ 
with  the  introduction  of  a  scaling  factor  that  counts  the  number  of  permutations. 
This  scaling  factor  depends  on  how  distinct  the  entries  of  the  tuple  are.  If  all  the 
entries  in  a  tuple  k  are  distinct,  then  k  has  M\  permutations.  If  an  entry  ki  is  repeated 
r{ki)  times  in  k,  then  every  distinct  permutation  has  r(fc;)!  copies.  Then,  the  scaling 
factor  with  ordered  indices  is 

N{k)  = 


M\ 


Y\ir{ki)\ 

where  the  product  is  taken  over  i  for  which  ki  are  distinct. 
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3  Phase  Noisy  Envelope  and  An  Approximation 


In  the  previous  sections,  we  have  seen  that  the  squared-envelope  of  a  phase  noisy- 
sinusoidal  plays  a  critical  role  in  the  perfornaance  analysis  of  the  incoherent  IF  re¬ 
ception  of  heterodyne  FSK  system.  This  squared-envelope  is  the  random  variable  X 
defined  in  (9)  which  we  repeat  here  for  convenience: 

(27) 

where  we  have  removed  the  explicit  argument  7  for  notational  simplicity.  Due  to  its 
importance  in  the  performance  analysis,  the  determination  of  the  probability  den¬ 
sity  of  X,  or  equivalently  its  moment  generating  function,  has  received  considerable 
attention.  Foschini  and  his  coworkers  provided  the  first  comprehensive  treatment  of 
this  problem  in  [3]  and  [9].  We  summarize  their  approach  below,  since  it  is  directly 
related  to  our  subsequent  analysis. 

Foschini  et.  al.  expand  the  complex  exponential  into  its  power  series  and  retain 
the  first  order  7  terms.  The  resulting  approximation,  Xi,  is  linear  in  7  and  is  given 

by 


(We  use  the  subscript  L  to  emphasize  the  linear  nature  of  this  approximation.)  Fos¬ 
chini  et.  al.  expand  -0(4)  in  Fourier  cosine  series  on  (0, 1)  and  obtain 

Xl  =  1  —  'yz^Dz  (29) 


X  =  f 
Jo 


where  z  is  an  infinite  dimensional  vector  of  independent,  identically  distributed  Gaus¬ 
sian  random  variables  with  zero-mean  and  unit  variance,  and  D  is  an  infinite  dimen¬ 
sional  diagonal  matrix  with  da  =  l/(i7r)^.  Thus,  z^Dz  is  a  compact  notation  for 
zf  f{i7ry.  The  moment  generating  function  of  Xi  is  then  found  as  [9] 


MxM  =  E  (e'^7 


\/2'ys 
sinh  \/2'ys 


(30) 


and  is  inverse  Laplace-transformed  to  obtain  the  density  function  of  X  in  semi-closed 
form. 

Equation  (30)  can  be  used  in  conjunction  with  (11)  to  obtain  the  error  probability 
for  the  single  sample  receiver  with  the  linear  approximation.  The  result  is  obtained 
as 


Pe  = 


|e  sin  '  ^7  < 

00  ^7  > 


(31) 
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where  the  first  case  follows  via  sin.h(ja:)  =  jsiii(a:),  while  the  second  case  follows 
from  the  derivation  of  (30)  in  [9].  It  is  seen  that  this  approximation  fails  to  give 
sensible  results  at  high  SNR  values.  This  is  due  to  the  fact  that  although  A’’  can 
take  values  only  between  0  and  1,  Xr,  can  become  negative.  In  fact,  the  density  of 
Xi  has  a  nonzero  tail  for  negative  arguments,  which  becomes  increasingly  dominant 
as  7  increases.  According  to  (31),  this  negative  tail  of  the  density  function  pxi,{x) 
decays  exponentially,  as  exp{Tr^x/2'y)  for  x  <  0,  so  that  when  (  >  7r^/7  the  integral  of 
g-^»/2p^^(j.)  diverges.  This  problem  is  not  observed  in  the  numerical  results  of  [3]  due 
to  the  truncation  of  the  negative  tail  of  the  probability  density  function.  However,  the 
truncation  weakens  the  analytical  basis  of  the  approximation:  the  truncated  density 
function  is  no  longer  a  first  order  approximation  to  that  of  the  actual  envelope,  this 
introduces  an  uncertainty  on  the  error  probability  results. 

We  seek  a  better  approximation  to  the  random  variable  X  by  forcing  the  range 
of  the  approximate  random  variable,  say  A,  to  match  the  range  of  X,  namely  (0,1). 
Therefore  we  require  a  good  approximation  to  satisfy 

1.  0  <  A(7)  <  1  for  aU  7  >  0  and  all  Brownian  sample  paths 

2.  X{'y)  and  A’^(7)  match  to  the  first  order  in  7  for  all 

The  range  constraint  above  suggests  that  A(7)  can  be  written  as 

A(7)  =  exp  [-(?(v/7^)] 

where  (7  is  a  functional  whose  domain  is  the  set  of  functions  defined  on  [0, 1]  which 
vanish  at  0,  and  whose  range  is  [0, 00).  We  force  G  to  satisfy 

1.  G{f)  >  0  for  aU  f{t),  with  equality  if  and  only  if  /(f)  =  0 

2.  (?(/)  =  G{-f)  for  aU  /(f), 

where  the  second  property  is  due  to  the  fact  that  the  two  sample  pathp  ■0(f)  and 
—0(f)  result  in  the  same  value  of  X{‘j).  For  A(7)  and  A(7)  to  be  identical  to  the 
first  order  in  7,  A(0)  =  A(0)  and  A'(0)  =  A'(0)  must  be  satisfied,  where  ‘prime’ 
denotes  the  first  derivative  with  respect  to  7.  We  state  one  functional  G{f)  that 
satisfies  these  requirements  in  the  following  lemma. 

Lemma  1:  If  G{f)  =  fg  P  dt  —  (/□  /  dt^  ,  then  A'(7)  and  A’’(7)  are  identical  to 
the  first  order  in  7. 
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Proof  :  Since  A’’(0)  =  1,  G(0)  =  0  is  needed  for  A’(O)  =  A’(O).  On  the  other 
hand,  it  is  straightforward,  but  somewhat  lengthy,  to  show  that 

X'(0)  =  dt  . 

For  a  G  that  satisfies  G(0)  =  0, 


A  (0)  -  — • 

It  is  easily  seen  that  the  functional  given  in  the  statement  of  the  lemma  satisfies  both 
of  the  requirements.  □  _ 

Note  that  the  functional  of  the  Lemma  1  satisfies  the  positivity  and  evenness 
conditions  as  well.  Therefore  the  random  variable  Xe  defined  as 


=  exp(— 7z^Z?z) 


(32) 


with  z  and  D  as  defined  previously,  promises  to  be  a  good  approximation  to  X.  We 
use  the  subscript  E  to  denote  the  exponential  nattire  of  this  approximation. 

To  summarize,  we  have  obtained  an  approximation  that  retains  the  desirable 
feature  of  that  of  Foschini  et.  al.  while  having  the  additional  feature  that  it  takes 
values  that  are  in  the  same  range  as  the  original  random  variable. 

A  nice  feature  of  the  random  variable  Xs  is  that  its  moments  are  easily  obtained 
from  (30)  as 


m  =  E(x%) 


sinh(A/27t) 


(33) 


for  all  real  t  >  — 7r^/27.  For  t  <  — 7r^/27,  the  t’th  moment  of  Xe  is  infinite. 

An  indication  of  how  well  the  two  approximations  model  X  is  the  behavior  of  the 
first  two  moments  as  7  varies.  The  moments  of  Xe  are  already  found  in  (33).  We 
now  evaluate  the  first  two  moments  of  X.  Rewriting  (27)  as  a  double  integral  and 
taking  the  expectation  we  obtain 


E{X)  =  r  /'E{exp[iv^(V>(t)- V’(^))]}  dtds  . 

Jo  Jo 

Now  we  use  the  fact  that  y^('0(<)  —  i'is))  is  Gaussian  with  zero-mean  and  variance 
7  |i  —  s|  and  get 


E{X) 


j  j  exp  {j  \t  ~  s\ /2)  dt  ds 


4 

7 


7 


(34) 
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The  calculation  of  E  [X^)  follows  along  the  same  lines:  we  first  express  as  a 
four-fold  integral,  take  the  expectation  and  express  the  integrand  as  a  Gaussian  char¬ 
acteristic  function  while  observing  the  dependencies  of  the  random  variables.  We 
finally  obtain 


2--  +  — 
72  7  272 


\37  ^  97V 


-1- 


(35) 


The  first  two  moments  of  X  as  given  by  Equations  (34)  and  (35)  are  in  agreement 
with  previous  calculations  of  the  mean  and  the  variance  in  [10]  and  [13]. 

Finally  we  calculate  the  first  two  moments  of  Xl  (without  truncation).  The  first 
moment  is  obtained  as 


E{Xl)  =  1-7£?(z^T»z) 

00  1 

=  1  ^ 


=  1 

and  the  second  moment  is  obtained  as 


7 

6 


(36) 


E  {Xl’)  =  1-2-/B  (n.^Dz)  +  7’B  ((z^/)z)“) 

00  1  00  CO  pi/  2  _2  'V 

=  1  -  2'Y  V  — —  +  V  V  ^ 

(n7r)2(7n7r)2 

00  o  00  1  1 


7  7^ 

=  1  _  I  + 

3  20 


+  7'  F  y 

^  ,  (n7r)2  (m7r)2 

n^mm=l  \  /  V  / 


(37) 


where  we  have  used  E{z^)  =  3  and  —  7r‘*/90  [15].  The  first  two  moments 

of  X,  Xe  and  Xl  as  given  by  Equations  (33)-(37)  are  shown  in  Figure  5  as  functions 
of  7.  It  can  be  seen  from  the  figure  that  the  moments  of  Xi  agree  with  those  of 
the  actual  random  variable  A"  only  for  very  small  7  while  the  moments  of  A^  are 
very  close  to  the  actual  moments  for  all  7.  Therefore  we  expect  the  exponential 
approximation  to  be  an  accurate  one. 


4  Density  of  Xe 

The  moments  of  Xe  as  given  by  (33)  provides  complete  information  about  its  statis¬ 
tical  behavior.  In  principle,  one  could  compute  the  moment  generating  function  of 
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Xe  from  the  moments  {l^{n)}  as 


n=0  '*'• 

and  find  the  density  function  as  the  inverse-Laplace  transform.  However,  since  the 
series  above  is  slowly  converging,  especially  for  large  s,  this  is  not  an  computationally 
attractive  approach.  Instead,  we  use  an  indirect  approach  which  is  based  on  orthog¬ 
onal  polynomials  for  finding  the  density.  We  first  prove  the  folloAving  about  the  value 
of  the  density  function  at  the  origin. 

Lemma  2:  Let  ^(a:)  denote  the  probability  density  function  of  Xe-  Then  ^(0) 
has  the  following  property: 

0  if  7  <  7r^/2 
oo  if  7  >  7r^/2  . 

Proof  :  We  prove  the  first  part  of  the  lemma  by  using  ChernofF  bound  [16].  We 
have 

^(0)  =  lim  -  Pr  (0  <  Xe  <  e)  . 

From  the  definition  of  Xe  the  probability  above  is 


Pr{0  <  Xe  <e)  =  Pr  (  z^T>z  >  --Inc 
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We  now  bound  the  right  hand  side  by  using  the  ChernofF  bound  on  z^Dz  via  (33)  to 
obtain 


Pr  (0  <  Xe  <  e)  < 


sin(V^)  / 


1/2 


for  0  <  s  <  7r^/2  . 


Dividing  both  sides  by  e  and  taking  the  limit  we  obtain 


q{0)  <  Urn 


VTs 

sin(\/^)  / 


for  0  <  5  <  71^/2  . 


J 


The  limit  above  is  0  if  s/7  >  1,  which  is  possible  for  some  s  in  the  allowed  range  if 
7  <  71^/2.  Since  q{x)  >  0,  g(0)  =  0. 

For  the  second  part  of  the  proof,  we  use  the  fact  that  each  of  the  random  variables 
in  the  series  of  z^Dz  are  nonnegative  to  obtain  the  following  lower  bound. 


Pr  (0  <  Xe  <  e)  =  Pr 


>  Pr 


> 
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The  rightmost  probability  is  2(5(  y  — tt^Ih  e/7)  where  Q{x)  is  the  complementary  dis¬ 
tribution  function  for  the  unit  normal  random  variable.  Proceeding  as  before  we 
obtain 

5(0)  >  Um  ■ 

Finally  we  use  Q{x)  ~  as  s  00  [17]  to  obtain 


?(0)> 


27 

hm  — . 

v  —  In  e 


The  limit  above  is  infinite  if  7  >  1 2‘y,  this  completes  the  proof.  □ 

The  lemma  above  indicates  that  the  density  of  Xe  changes  its  form  as  7  exceeds  a 
critical  value,  7r^/2.  The  left  tail  of  the  density,  i.e.  q{x)  for  small  x,  strongly  affects 
the  probability  of  error,  since  Pg  is  the  integral  of  the  product  of  q{x)  (or  convolution 
of  with  itself)  and  some  rapidly  decaying  function  such  as  Therefore 

the  behavior  of  g(x)  for  x  1  is  of  special  interest  for  accurate  calculation  of  error 
probability.  The  following  lemma  indicates  that  this  behavior  is  polynomial. 

Lemma  3:  q{x)  —  0  for  x  small,  where  /(x)  =  0{g{x))  is  equivalent  to 

hma,_,o  f{^)l9{^)  =  c  for  a  positive  finite  c. 

Proof  :  From  (33)  we  have 


^  i  (v^^/sinh(7'5^)^  ^  t  >  0 

lo  ^  ~  I  (\/27|i|/sin(^27|<|))  ‘  0  >  i  >  -7rV27 

I  00  t  <  — 7r^/27  . 

Since  x“^  dx  converges  for  /?  <  1  and  diverges  for  /3  >  1  for  any  5  >  0,  we  must 
have 

1 

x~^q{x)  —  c(x)— 

X 

for  some  function  c(x)  with  finite  and  positive  c(0),  where  we  have  assumed  that  q{x) 
is  continuous  at  x  =  0.  Then  the  claim  is  proved.  □ 

This  lemma  is  in  agreement  with  the  previous  lemma  in  predicting  the  value  of 
g(0).  Moreover,  it  suggests  that  for  accurate  calculation  of  ^(x)  for  small  x  it  is  better 
to  calculate  the  tilted  density  g(x)/x^"’^.  Before  we  describe  a  method  to  perform 
this  calculation,  we  exploit  a  useful  property  of  A"£;(7)  that  will  enable  us  to  compute 
the  density  function  for  a  single  value  of  7,  say  7  =  1,  and  to  obtain  all  other  density 
functions  from  this  computed  one.  From  the  definition  of  A’’£:(7)  we  have 


A^;(7) 


1/7  _ 
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Then  for  any  positive  71  and  72,  the  associated  Xe's  satisfy 


which  specifies  a  one-to-one  mapping  from  Xf;(7i)  to  X£:(72)  via 

g{x)  =  . 


Then  the  densities  of  the  two  random  variables  are  related  via 


[18] 


972  (®) 


971(9  M«^)) 

I9'  (9"H®))t 


where  we  introduced  a  subscript  to  the  density  function  to  emphasize  its  dependence 
on  7.  Upon  manipulation,  we  obtain 


972(®)  =  ^97i(®“)  (39) 

where  a  =  71/72. 

The  calculation  oi  q{x)  for  a  given  7,  as  will  be  seen  shortly,  is  a  computationally 
intensive  effort.  Therefore  the  simple  relation  (39)  introduces  a  big  saving  in  the 
required  numerical  work. 

The  moments  of  a  random  variable  are  the  projections  of  its  density  function  on 
a  sequence  of  polynomials.  Therefore,  the  density  can  be  reconstructed  from  these 
moments  via  a  complete  orthonormal  polynomial  basis.  The  even  (or  odd)  indexed 
Legendre  polynomials  constitute  such  a  basis  on  the  interval  (0, 1)  [19],  Any  piecewise 
continuous  function  f{x)  on  (0, 1)  that  has  left  and  right  derivatives  at  every  point 
in  the  interval  can  be  expressed  as 

00 

f{x)  =  E  (2^  +  ^)FnPn{x)  (40) 

n=0 

n  even 


where  Pn{x)  is  the  n’th  Legendre  polynomial  and  Fn  is  given  by 


Speciahzing  to  the  case  of 


where  we  have  defined 


/(’T)  = 


9(3^) 

x'^ 


A  ^ 

27 


(41) 


17 


we  obtain 


Fr,  =  E  [Xi'‘F„(X£.)]  .  (42) 

To  obtain  the  coefficients  in  terms  of  the  moments,  we  first  expand  Pn{x)  in  its 
Taylor  series  as  [19] 

n 

Prt{x)  =  “nfe®*’  (43) 

fc=0 
k  even 

where  the  coefficients  a„fe  are  given  by 


«nfe  = 


2nA  ’ 


n- 


^{n  -  k)l2, 


(44) 


Now  we  can  compute  the  expectation  in  (42)  in  terms  of  the  moments  given  by  (33) 
as  , 

n 

Fn=  Yl  ^nkK^  -  h)  (45) 

fc=0 
k  even 

and,  consequently,  the  density  function  as 


q{x) 


x’^  ^  (2n,  +  l)Pn(a:)  ^ 


nssO 

n  even 


lesO 
k  even 


(46) 


The  convergence  of  the  series  in  (46)  is  quite  fast.  Note  that  the  moments  always 
remain  finite  since  k  —  h  >  —Tr^/2'y  for  all  fc  >  0.  It  is  also  worth  mentioning  that 
although  the  formula  above  remains  valid  for  any  value  of  h,  in  particular  for  h  =  0 
which  corresponds  to  the  direct  computation  of  q{x)  without  tilting,  the  value  that 
we  use  is  the  most  appropriate  one  because  it  captures  the  polynomial  behavior  of 
q{x)  in  the  small  x  region. 

The  results  of  the  computation  is  shown  in  Figure  6  for  various  7  values.  As 
expected  the  density  is  more  concentrated  around  unity  for  small  7.  As  7  increases, 
the  density  gets  flat  until  7  =  7r^/2  and  becomes  peaked  at  a;  =  0  thereafter.  The 
density  function  is  observed  to  have  a  sharp  decay  around  its  a;  =  1  tail.  In  Appendix 
C,  we  show  that  the  nature  of  this  decay  is  exponential  in  7/(1  —  x),  provided  that 
q{x)  is  bounded  and  analytic.  (Boundedness  holds  for  7  <  7r^/2  and  analyticity  is 
assumed  to  hold.) 

The  density  function  of  Xe  can  also  be  calculated  with  another  method.  The 
distribution  function  of  the  random  variable  z^Dz  is  given  in  [3],  from  which  the 
distribution  function  of  Xe  can  be  directly  computed,  the  density  function  is  then 
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obtained  by  differentiation.  It  was  observed  that  this  method  yields  results  which  are 
in  perfect  agreement  with  Figure  6. 

In  the  next  section,  we  wiU  use  the  density  we  have  obtained  to  evaluate  the 
performance  of  the  receivers  under  consideration. 

5  Results  and  Discussion 

The  error  probability  for  the  single  sample  receiver  is  given  by  (11)  as 

Pe  =  \  j  dx  . 

Z  Jo 

We  have  computed  the  error  probability  using  this  integral.  The  result  of  this  com¬ 
putation  is  shown  in  Figure  7.  It  is  seen  from  this  figure  that  the  error  probability 
is  above  10“^  for  7  >  2  and  ^  <  22  dB.  The  single  sample  receiver  can  not  combat 
the  phase  noise  effectively  for  large  7,  and  the  bit  error  rate  severely  degrades  with 
increasing  phase  noise  strength. 

The  error  probability  calculation  is  more  involved  for  the  multisample  receivers. 
For  the  single-filter  receiver,  the  error  probability  is  given  by  (12)  which  can  be  written 
as 

1 

-Pe  =  2  exp(-^a:/2M)g.y/M(®)  dx  .  (47) 

Using  (39)  with  71  =  7  and  72  =  'y/M,  we  can  rewrite  (47)  as 

=  l  r  exp(-(x/2M)Mx^^-^qJx^)  dx 
2  Jo 

which  with  the  change  of  variable  y  =  x^  yields 

P.  =  \jl  exp(-{y>/"/2M),,(y)  dy  .  (48) 

We  want  to  find  the  value  of  M  that  minimizes  (48).  Since  no  closed  form  for  g(-) 
is  available,  this  optimization  involves  evaluating  this  equation  for  M  =  1,2,3,... 
until  the  optimum  is  reached®.  Equation  (48)  has  the  desired  form  since  the  density 
function  has  the  same  parameter  7  for  all  M. 

The  result  of  the  computation  of  the  bit  error  rate  via  (48)  is  shown  in  Figure  8. 
The  plotted  bit  error  rate  values  have  been  optimized  over  M.  The  optimal  value 

®Iii  fact,  we  could  search  for  optimal  non-integer  M  for  the  single-filter  receiver.  This  does  not 
seem  to  be  necessary  since  the  observed  transitions  with  integer  values  were  smooth. 
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of  M  increases  with  SNR  and  the  phase  noise  strength.  This  is  because  as  the  SNR 
increases  the  receiver  prefers  to  use  a  smaller  portion  of  the  signal  energy  to  reduce 
the  effect  of  the  phase  noise  with  the  same  factor. 

Finally  we  focus  on  the  performance  of  the  double  filter  multisampling  receiver.  In 
Section  2.2,  we  showed  that  the  bit  error  rate  for  this  receiver  can  be  computed  by  first 
finding  the  ikf-fold  convolution  of  the  density  function  of  X{'^  jM)  and  then  removing 
the  conditioning  on  r  in  Equation  (19).  However,  the  density  function, 
changes  with  M,  thus  this  method  is  computationally  prohibitive.  Therefore,  we  use 
the  alternative  approach  which  resulted  in  Equations  (25)  and  (26).  One  can  further 
reduce  the  complexity  by  expressing  a{k)  in  terms  of  q-y{x)  as  follows: 

a{k)  =  f  exp(— ^.T/2ikf)q.Y/Af(®) 

J  0 

=  r  //^exp(-|j/'/^/2M)q^(y)dy  .  (49) 

where  we  have  used  the  same  transformation  used  in  obtaining  (48). 

Equation  (25)  in  conjunction  with  (49)  provides  a  method  for  the  error  probability 
computation.  In  performing  the  numerical  computation  the  symmetry  between  the 
terms  in  the  inner  summation  was  exploited  as  outlined  in  Section  2.2.  The  resulting 
computation  is  a  very  efficient  one  in  comparison  with  the  direct  convolutions.  The 
results  are  shown  in  Figure  9.  The  bit  error  rates  are  again  optimized  with  respect  to 
M  as  in  the  single  filter  case.  A  comparison  of  the  performance  curves  for  the  single 
sample  receiver  and  multisample  receiver  with  and  without  double  filtering  reveals 
that  the  increased  receiver  complexity  by  oversampling  the  signal  and  additional 
lowpass  filtering  may  improve  the  error  performance  to  a  significant  extent  when  the 
signal-to-noise  ratio  is  high  and  the  phase  noise  is  strong.  This  can  be  seen  from 
Figure  10  which  shows  the  performance  of  the  three  receivers  for  7  =  1  and  7  =  4. 
For  low  SNR  values  (up  to  12  dB  for  7  =  1  and  9  dB  for  7  =  4),  the  best  receiver 
is  the  simplest  one,  i.e.  the  optimal  value  for  the  multisampling  receivers  is  M  =  1. 
Only  when  the  SNR  is  further  increased,  an  increase  in  the  receiver  complexity  is 
warranted.  At  a  bit  error  rate  of  10~®  with  7  =  1,  the  double  filter  receiver  has  about 
3.5  dB  gain  over  multisample  single- filter  receiver  and  about  6  dB  gain  over  the  single 
sample  receiver. 

The  error  probability  for  the  double  filter  receiver  is  very  close  to  the  lower  bound 
obtained  in  Section  2.2,  as  illustrated  by  Figure  11  for  7  =  1  and  7  =  4.  Therefore 


20 


the  simple  lower  bound  expression 


Pe  U 


4M 

7 


^(1  _ 

7 


(50) 


in  conjunction  with  (19)  provides  a  simple  and  reliable  approximation  to  the  proba¬ 
bility  of  error.  With  the  use  of  this  closed-form  result,  the  performance  of  a  system 
with  a  given  SNR  and  phase  noise  strength  can  be  found  by  a  straightforward  opti¬ 
mization  over  M.  It  should  be  noted  that  the  value  of  M  that  optimizes  (50)  is,  in 
general,  lower  than  the  exact  optimal  value.  This  is  due  to  the  fact  that  the  variation 
of  r  from  its  mean  is  neglected  in  the  Jensen’s  bound  that  results  in  (50). 

Since  the  phase  noisy  SNR  r  in  (20)  is  the  average  of  M  i.i.d.  random  variables, 
a  Gaussian  approximation  for  large  SNR  and  phase  noise  strength  is  also  plausible. 
In  fact,  approximating  the  density  of  r  with  a  Gaussian  density  on  the  interval  (0,^) 
yields  an  error  probability  which  is  very  close  to  that  shown  in  Figure  9.  However, 
a  closed-form  result  that  is  obtained  by  extending  the  range  to  the  real  line  yields  a 
poor  estimate  due  to  the  large  negative  tail  of  Pe(^)- 

The  performance  of  the  double-filter  receiver  as  predicted  by  Figure  9  and  approx¬ 
imated  by  Equation  (50)  is  very  close  to  that  predicted  by  Foschini  and  his  coworkers 
in  [3].  This  is  because  the  truncated  density  tail  of  the  linear  approximation  to  the 
phase- noisy  envelope  is  extremely  low  for  a  phase  noise  strength  of  7/M,  when  M 
is  around  the  optimal  value.  This  is  in  contrast  with  the  single-filter  case  where  the 
agreement  of  our  results  with  [3]  is  not  as  strong  due  to  smaller  values  for  the  optimal 
value  of  M. 

The  signal-to-noise  ratio  needed  to  remain  below  a  certain  predetermined  bit  error 
rate  increases  with  the  introduction  of  phase  noise.  The  amount  of  increase  in  the  SNR 
is  referred  to  as  phase  noise  penalty.  In  Figure  12  the  penalties  for  the  three  receivers 
are  shown  as  a  function  of  7  for  a  bit  error  rate  of  10~®.  While  the  difference  in  the 
penalties  for  the  single  sample  receiver  and  the  multisample  double-filter  receiver  is 
less  than  1  dB  for  7  <  0.5,  the  latter  receiver  outperforms  the  former  at  higher  phase 
noise  strengths.  This  is  in  support  of  increased  receiver  complexity  that  is  tailored 
towards  combating  the  phase  noise  especially  when  7  >  1. 

Advancing  optical  technology  promises  fiber-optic  networks  with  data  rates  in 
excess  of  100  Mbps  per  user  and  semiconductor  lasers  with  linewidths  of  5  MHz  or 
less.  With  these  parameters,  the  phase  noise  strength  7  is  in  the  range  of  0.6  for 
which  the  penalty  is  about  1  dB  with  the  double  filter  receiver  and  2  dB  with  the 
single  sample  receiver.  This  means  not  only  that  the  phase  noise  will  not  be  a  very 
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significant  impairment  on  the  bit  error  rate,  but  also  that  simple  IF  receivers  will  be 
sufficient  for  most  future  applications. 

6  Extension  to  N-FSK 

The  results  of  the  previous  sections  about  binary  FSK  can  be  extended  to  FSK  with 
N  hypotheses  (N-FSK)  without  much  difficulty.  The  receiver  structure  is  the  same 
as  Figure  2  with  the  number  of  branches  increased  from  2  to  N.  We  present  an 
analysis  to  the  performance  of  N-FSK  below.  It  must  be  emphasized  that  we  hold 
the  transmission  rate  of  data  bits  and  the  energy  per  data  bit  constant  while  making 
comparisons  among  different  iV-ary  FSK  schemes,  as  in  [20]. 

Let  the  possible  hypotheses  be  0, 1, . . . ,  iV  —  1.  Let  the  normalized  filter  outputs  be 
Vi,  i  =  0, 1, . . . ,  —  1,  and  assume  a  “0”  is  transmitted.  Then  Vi  will  be  independent 

random  variables  with  the  density  functions  found  from  Section  2  as 

p(vo|r) 

p{vi) 

Let  Pn{'I')  denote  the  probability  of  a  symbol  error  conditioned  on  the  phase-noisy 
SNR  r.  Since  a  symbol  error  will  occur  when  any  of  the  Vf  for  f  0  is  greater  than 
Vo,  we  have 


=  f  — ]  e  (\/4ruo)  no  >  0 


r  J 


(M-  1)! 


,-Vi 


Vi  >  0  i  =  1, 2, . . . ,  iV  —  1  . 


(51) 

(62) 


FA,(r)  =  l-Pt(l/„>max{V'i,...,Vjv_,}|r) 

=  1  -  y  [Pr(Vi  <  Vo)]^“^p(voii')<lwo 

where  we  have  used  the  fact  that  Vi  for  i  >1  are  i.i.d.  conditioned  on  r.  Then  with 
the  use  of  (52),  we  obtain 


Pjv(r) 


/ 


M~\  k 
fe=o 


,iV-l 


kl 


p{vo\r)dvo 


which  does  not  seem  to  be  easily  brought  to  a  closed- form  similar  to  (19).  Instead 
of  computing  Pjv(r)  numerically  we  will  find  upper  and  lower  bounds  in  closed-form. 
We  first  use  1  —  (1  —  y)^~^  <  {N  —  l)y  for  0  <  y  <  1,  to  obtain 

r 

PN{r)  <{N-1)  Y.  7T"~"'’P(^o|r’)  dvo  =  {N  -  IJP^ir)  (53) 

fc=o 
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where  P2{r)  is  the  conditional  error  probability  for  binary  FSK  given  by  (19),  The 
upper  bound  in  (53)  is  exactly  the  union  bound.  To  obtain  a  lower  bound  to  Piv(T’) 
we  use 

1  -  (1  -  S,)"-'  >  (JV  -  1);,  - 


and  we  get 

Pu{r)  >{N-  l)F:(r)  - 
where  the  function  5'm(^)  is  defined  as 


9M{r)  =  j  "“j  p{vo\r)dvo 


To  obtain  5m(^)  in  closed-form  we  first  use 


/M-l  ,,k 


M—\  M—1  n+m 


E  E  E 


.  fe=o 


=  e 


and  we  get 


n=0  m=0 

2(M-1) 

-2„  Y, 

1=0 


2{M-1) 


n\ml 


—1) 


li  S 


fe=0 


Mj 


guir)  =  /  e  -l3ivl^p{vo\r)dvo 

/=o 


where  we  have  defined 


min(i,M— 1) 


P, 


E 

k=0 


.kj 


(64) 


(55) 


We  proceed  exactly  as  in  Appendix  A  to  evaluate  the  integral  in  (54)  to  obtain  the 
result  as 

g-2r/3  2(^1)  2(^1) 

I  /-n 


guir)  = 


E 


^  n=0  i=n 


(56) 


which  is  quite  similar  to  (19). 

We  finally  have  the  bounds  to  the  conditional  error  probability  as 


(iV-l)P2(r)-  ^^9M{r)<PN{r)<{N  ~1)P2{t)  (57) 

in  conjunction  with  Equations  (19)  and  (56).  Note  that  the  two  bounds  are  equal  for 
N  =  2.  The  bounds  are  tight  for  small  N  and  large  r.  For  A”  <  16  and  r  >  14  dB, 
the  two  bounds  are  within  an  order  of  magnitude  for  any  M. 
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The  symbol  error  probability  Pjv  is  obtained  as  the  expected  value  of  PN{r)  opti¬ 
mized  over  M,  i.e. 

Pjv  =  min  Er  [PNir)]  . 

M 

Since  only  bounds  to  Pn{‘>’)  are  available  in  closed-form,  we  will  perform  the  opti¬ 
mization  for  the  upper  bound  and  use  the  optimal  value  of  M,  say  M*,  in  the  lower 
bound.  The  result  is 

{N  -  1}P2  -  2^)  <Pn<{N-  1)P2  (58) 


where  P2  is  the  binary  FSK  error  probability  found  in  the  previous  section.  The  bit 
error  probability  Pb  is  related  to  the  symbol  error  probability  as  [12] 


N/2 

N-1 


Pn 


which  results  in 

Ep,  _  [s„.(r)|  <  n  <  Ep^  .  (59) 

The  upper  bound  to  the  actual  bit  error  rate  reveals  that  for  N-FSK  at  a  bit  SNR 
value  of  ^  and  a  phase  noise  of  7  is  at  most  iV/2  times  the  bit  error  rate  of  binary 
FSK  at  an  SNR  of  ^  logj  N  and  a  phase  noise  strength  of  7log2  N.  The  expectation  in 
the  lower  bound  can  be  computed  without  any  convolutions,  using  the  same  method 
described  in  the  previous  section.  Figure  13  shows  the  upper  and  lower  bounds  to 
Pb  for  iV  =  4,8, 16  as  well  as  the  binary  bit  error  rate  for  7  =  1.  It  is  seen  that  the 
bounds  are  very  close,  and  that  the  bit  error  rate  improves  with  increasing  N.  At 
a  bit  error  rate  of  10”^  the  improvement  over  binary  FSK  is  2.5  dB  with  A  =  4, 
4  dB  with  N  =  8  and  4.8  dB  with  N  =  16.  Since  the  cost  of  the  receiver  and  the 
bandwidth  occupation  are  proportional  to  iV,  4-FSK  may  be  a  desirable  alternative 
to  binary  FSK  for  this  value  of  phase-noise  strength. 


7  Conclusion 

We  have  analyzed  the  performance  of  a  binary  FSK  coherent  optical  system  with  an 
accurate  treatment  of  the  phase  noise.  We  obtained  closed-form  expressions  for  the 
error  probability  conditioned  on  a  normalized  phase-noisy  amplitude.  These  results 
can  be  used  in  conjunction  with  other  approximations  or  solutions  to  the  phase- 
noisy  amphtude,  e.g.  the  one  given  in  [7].  In  this  work,  we  used  an  exponential 
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approximation  to  the  square  of  the  amplitude  which  was  found  to  be  accurate.  We 
obtained  the  density  of  this  approximation  from  its  moments,  which  are  available  in 
closed-form,  via  a  Legendre  series  method.  As  a  result,  we  were  able  to  obtain  the 
performance  of  the  receivers  proposed  in  [3]  with,  we  believe,  more  accuracy  than  in 
previous  work.  We  also  provided  a  simple  and  tight  lower  bound  in  closed-form.  Our 
results  show  that  the  simple  correlator  receiver  is  adequate  only  for  a  certain  range  of 
signal-to-noise  ratio  and  phase  noise  values,  beyond  which  receivers  which  track  the 
phase  noise  by  oversampling  improve  considerably  upon  its  performance. 

We  also  extended  the  results  to  the  case  of  N-FSK  and  obtained  tight  upper  and 
lower  bounds  to  the  bit  error  rate.  As  a  result,  we  observed  that  increasing  the  signal 
dimensionality  may  be  desirable  up  to  a  certain  limit  to  be  robust  against  phase  noise. 


Appendix 


A  Derivation  of  Conditional  Error  Probability  for 
Double— Filter  Receiver 


In  this  appendix,  we  derive  (19)  which  gives  the  error  probability  of  the  multisample 
double-filter  receiver  condition  on  phase-noisy  signal-to-noise  ratio  r.  From  (18)  we 
have 


A/-1 


1  ,, 


dx 


V  ) 


M-l 


(Af_l)/2  ^  },l 
k~0 


i: 


(A.l) 


where  we  have  defined 


>lfc(r)  =  (v/^)  dx  . 

With  the  change  of  variable  y  —  ^/x  we  obtain 

Afc(r)  2^  (\/4rt/)  dy  . 
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The  integral  above  can  be  found  from  11.4.28  of  [15]  in  conjunction  with  /„(a:)  = 
as 


Ak{r) 


^(M-i)/2  r(fc  +  M) 


F{k 


2k+M  r(M) 
where  F  is  the  confluent  hypergeoinetric  function, 
with  (A.l)  yields 


+  M,M,r/2)  (A.2) 

Equation  (A.2)  in  conjunction 


M-l 


Tt  /_ 


(A.3) 


Now  we  use  the  Kummer  transformation  (13.1.27)  and  the  power  series  for  a  confluent 
hypergeoinetric  function  (13.1.2)  from  [15]  to  simplify  (A.3)  via 


F{k  +  M,M,rl2)  =  e’-/='F(-fc,M,-r/2) 

.W2^(-fc)n(-r’/2)- 
^0  ”• 


(A.4) 


where  (a)„  =  a{a  +  1)  •  •  •  (a  +  n  -  1).  Since  (-fe)„  =  0  for  n  >  fe  the  series  above  is 
a  finite  series.  Furthermore,  since  (  — fe)n  =  (  — l)"fc!/(A:  —  n)!  for  n  <  k  and  (M)„  = 
(M  +  n  —  1)!/(M  —  1)!,  (A.4)  becomes 


F(jfe  +  M,M,r/2) 
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n=0 


fei(M  -  1)!  (r/2)" 
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Inserting  (A.5)  into  (A.3),  using  T{k)  —  {k  -  1)!  and  canceling  the  common  terms, 
we  obtain 


g_r/2  i^l  I  fc 

Pei^)  =  Y 


(fe  +  M-1)!  (r/2)" 


fc=0 


2*’  {k  —  n)\{M  —  1  +  7i)! 
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(A.6) 


which  upon  interchange  of  summations  yields  (19) 


B  Convexity  of  Conditional  Error  Probability 


In  this  appendix  we  prove  that  the  error  probability  conditioned  on  phase-noisy  SNR, 
Pe(r),  is  a  convex  function  of  r.  From  (19),  we  have 
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where 
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k  +  M  -  1 
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Since  the  convexity  of  a  function  is  not  affected  by  multiplication  by  a  positive  con¬ 
stant  and  a  scaling  of  its  argument,  it  suffices  to  show  the  convexity  of  the  function 

h{r)  =  e~''t{r) 


where  t{r)  is  the  polynomial  of  degree  M  —  1  with  coefficients  Cn{M  —  l)/n!.  The 
second  derivative  of  h{r)  is 

h"{r)  —  6“’’  [t{r)  —  2t'{r)  -|-  t”(T’)] 

where  “prime”s  denote  derivatives.  Thus,  Pe{r)  is*  convex  if  and  only  if  the  polynomial 

p(r)  =  i(r)  -  2t'{r)  +  t"{r)  (B.3) 

of  degree  M  —  1  is  nonnegative  for  all  r  >  0.  The  latter  will  be  satisfied  if  all  the 
derivatives  of  p(r)  at  r  =  0  are  nonnegative,  by  Taylor’s  theorem.  The  n’th  derivative 
of  p(r)  at  r  =  0  is  given  as 

p(")(0)  =  -f 

=  c„(M-l)-2c„+i(M-l)-t-c„+2(M-l) 


Let  N  =  M  —  1  and  d„(iV)  =  c„(i\f)  —  Cn+i{N)  for  n  =  0, 1, . . . ,  iV.  Then  we  have 

pW(0)  =  d„(iV)-d„+i(iV).  (B.4) 


Thus,  we  need  to  show  that  the  coefficients  d„{iV’)  are  nonincreasing  in  n  for  every 
N.  To  do  this,  we  need  an  intermediate  result  given  in  the  following  lemma. 
Lemma  B.l  The  binomial  coefficients  satisfy 


(B.5) 


for  J  =  0, 1, 2, . . .  and  j  =  0, 1, . . . ,  J. 

Proof  :  We  will  use  induction  on  J.  For  J  =  0,  the  claim  is  obviously  true.  For 
J  =  1,  the  claim  is  easily  verified  for  both  j  =  0  and  j  =  1.  Now  let’s  assume  that 
the  claim  is  true  for  J  and  for  all  j  <  J.  We  will  show  that  this  implies  that  the 
claim  is  also  true  for  J  +  1  and  for  all  j  <  J  -f  1.  Let  a.j(  J)  denote  the  left  hand  side 
of  (B.5).  For  0  <  J  <  J,  we  have 


+  1) 


27 


which  is  the  desired  result.  (The  second  equality  above  follows  by  a  change  in  the 
index  of  summation,  the  third  one  follows  by  induction  hypothesis,  the  fourth  one  is 
a  simple  rearrangement  of  terms  and  the  last  one  follows  by  the  well-known  property 
of  the  Pascal  triangle.)  Finally  ao(J  -(- 1)  =  1  trivially  satisfies  (B.5).  Thus  the  claim 
is  true  for  all  j  <  J  -f  1.  This  completes  the  proof  by  induction.  □ 

A  corollary  to  Lemma  B.l  provides  a  closed-form  to  d„{iV'). 

Corollary  The  difference  coefficients  d„(iV)  satisfy 


f2N  +  1\ 
\N-n) 


n  =  0, 1, . . . ,  iV. 


Proof  :  From  (B.2)  we  have 


2'^c„(iV) 


(B.6) 


where  we  have  used  a  change  of  index  I  =  N  —  k.  Now  we  use  Lemma  B.l  with 
J  =  2N  and  j  =  N  —  n  to  get 


which  results  in 


d^N)  =  c„(7V)-c„+i{iV) 

2^\N  -n) 

as  claimed.  □ 
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As  a  result  of  this  corollary,  we  see  that  for  a  fixed  N,  dn{N)  are  decreasing  with 
71  for  0  <  n  <  N.  This  means  that  the  polynomial  p{r)  is  nonnegative  for  r  >  0, 
and  consequently  that  the  conditional  error  probability  Pe{r)  is  convex  for  any  M,  as 
explained  before. 

An  implication  of  the  convexity  of  Pe{r)  is  that  the  conditional  error  probability 
can  not  be  improved  by  time-sharing  between  two  phase- noisy  SNRs  and  7*2,  since 
Pe(Ari  -f-  (1  -  A)r2)  <  APe(ri)  -b  (1  -  X)Pe{r2)  for  aU  0  <  A  <  1. 

C  Tail  Behavior  of  q{x) 

In  this  appendix  we  concentrate  on  the  decay  properties  of  the  density  function  q{x) 
around  its  x  =  1  tail.  We  prove  that  this  decay  is  exponential  as  described  in  the 
following  property. 

Property:  The  density  function  q{x)  has  an  exponential  decay  around  x  =  1, 
characterized  as 

lim  (1  —  x)lng(x)  =  — 7/8  .  (C.l) 

aj— ♦!“ 

To  prove  this  property  we  need  two  lemmas.  We  first  define  a  transformation 
which  is  closely  related  to  Mellin  transform  [21],^ 

Definition:  The  moment  transform  of  a  function  /(x)  defined  on  (0, 1)  is  given 
by  ^ 

F{t)=  f  x7(x)dx  (C.2) 

Jo 

where  t  >  0. 

It  can  be  observed  that  for  any  bounded,  continuous  function  the  moment  trans¬ 
form  exists,  and  that  it  vanishes  a.s  t  —y  co.  The  behavior  of  a  function  in  its  x  =  1 
tail  is  related  to  the  asymptotic  behavior  of  its  moment  transform.  The  following 
lemma  formalizes  this  notion. 

Lemma  C.l  Let  /  and  g  be  two  bounded,  analytic  functions  on  (0, 1)  whose 
moment  transforms  satisfy  F{t)  <  G(t)  for  all  t  >  to-  Then  there  exists  e  >  0  such 
that  /(x)  <  g(x)  for  aU  x  >  1  —  e. 

Proof  :  Let  h  =  g  —  f.  h  is  bounded  and  analytic  with  moment  transform 
H  =  G  —  F.  Since  h  is  analytic,  there  is  a  neighborhood  of  1  in  which  h  does  not 
change  sign,  say  (1  —  1)  for  some  ^  >  0.  Let 

A  ~  max{|/?(x)|  :  x  G  (0,1)} 

B  —  [  h{x)  dx  . 

Ji-s 
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Suppose  h{x)  <  0  on  (1  —  6,1).  Then  the  following  inequalities  hold, 

f  x^h{x)  dx  <  f 
Jo  Jo 

f  x^h{x)  dx  <  (1  —  SyB  , 

Jl-6 


i-s  (I  _ 

x*'\h{x)\  dx  <  - - — 

r  “r  i 


which  implies 


-4(1  - 

t  +  1 


+  B 


(C.3) 


However,  since  A  >  0  and  B  <  0,t  can  be  made  large  enough  to  make  the  right  hand 
side  of  (C.3)  negative.  This  is  a  contradiction,  since  H{t)  >  0  for  all  t  >  to.  The 
proof  is  complete  with  the  choice  of  an  e  with  0  <  e  <  6.  □ 

The  following  lemma  establishes  a  relation  between  the  moment  transform  of  a 
function  and  the  Laplace  transform  of  its  x  =  1  tail. 

Lemma  C,2  Let  /  be  a  bounded  function  on  (0, 1)  and  let  /*  be  defined  as 


^  I  /(I  -  0  <  X  <  e 

1 0  otherwise 

for  0  <  €  <  1.  If  Qe{t)  is  the  Laplace  transform  of  /e(a:)  and  F{t)  is  the  moment 
transform  of  /(.t),  then  Q«(t)  F{t)  for  large  t  and  small  e. 

Proof  :  Qf{t)  is  defined  as 

Qe{t)  =  f  /(I  -  .'c)e"‘®  dx  . 

Jo 


With  small  e  we  use  e  ~  (1  —  x)‘  for  0  <  x  <  c,  to  obtain 


(5e(t)  f  /(I  -  x)(l  -  x)*  dx 
Jo 


Since  x‘  becomes  smaller  in  the  interval  (0, 1  —  e)  with  increasing  t,  the  rightmost  inte¬ 
gral  will  be  arbitrarily  close  to  F{t)  for  large  t.  Therefore,  the  quantity  |<56(t)  —  F{t)\ 
can  be  made  arbitrarily  close  to  0  for  sufficiently  small  e  and  large  t.  □ 

With  these  two  lemmas  at  hand  we  now  proceed  with  the  proof  of  the  property 
stated  at  the  beginning  of  this  appendix. 

Proof  of  Property  :  Lemma  C.2  states  that  the  tail  of  the  density  function  ^(x) 
around  x  =  1  can  be  found  as  the  inverse  Laplace  transform  of  its  moment  transform 
for  large  arguments.  The  moment  transform,  of  q[x)  is  given  by  (33)  and  can 
be  written  as 


1/2 

g-V-2^/2  _ 
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For  large  the  first  factor  is  bounded  between  1  and  v/f.  Thus,  for  large  t  we  have 

g-^/2  <  <  ^g-v^/2  _ 

Then  from  the  two  lemmas  we  proved,  the  function  g(l  —  x)  for  small  x  will  be 
bounded  by  the  two  functions  whose  Laplace  transforms  yield  the  bounds  in  (C.4). 
With  the  use  of  Formula  29.3.82  of  [16]  we  obtain 

/_JL_e-T'/8y  <  ^(1  _  j,)  <  /_!_  -  l) 

for  small  y,  which  upon  substitution  of  x  for  1  —  y  becomes 

<  ,(x)  <  -  l)  (0.6) 

for  X  close  to  1.  We  observe  that  the  dominant  tail  behavior  of  q{x)  is  exponential  in 
(—7/8(1  —  x)).  Equivalently,  by  taking  logarithms  of  all  sides  in  (C.5),  we  obtain 

lim  (1  —  x)ln  5(x)  =  — ^  (C.6) 

*—♦1—  o 

where  we  have  used  lima._*o®ln®  =  0.  This  completes  the  proof.  □ 
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List  of  Figures 


Figure  1:  The  optical  heterodyning  receiver. 

Figure  2:  The  IF  receiver,  a)  Front  end  {M  —  1  for  the  single  sample  receiver), 
b)  Processing  of  samples  for  single  sample  and  single-filter  receivers,  c)  Processing 
of  samples  for  the  multisample  double-filter  receiver. 

Figure  3:  Error  probability  of  the  double-filter  receiver  {Pe{r))  conditioned  on  the 
phase- noisy  signal-to-noise  ratio  r. 

Figure  4;  Lower  bound  on  the  error  probability  obtained  by  Jensen’s  inequality. 

Figure  5:  Comparison  of  statistics  of  the  phase-noisy  squared  envelope  and  its 
linear  and  exponential  approximations,  a)  Means,  b)  Second  moments. 

Figure  6:  Probability  density  function,  q{x),  of  the  exponential  approximation  Xe 
for  various  7  values;  a)  7  =  0, 0.5, 1, 2, 3, 4  b)  7  =  4, 6, 16. 

Figure  7:  Error  probability  of  single  sample  receiver  as  a  function  of  signal-to-noise 
ratio. 

Figure  8;  Error  probability  of  multisample  single-filter  receiver  as  a  function  of 
signal-to-noise  ratio  (optimal  values  of  M  are  shown  on  the  data  points). 

Figure  9:  Error  probability  of  multisample  double-filter  receiver  as  a  function  of 
signal-to-noise  ratio. 

Figure  10:  Comparison  of  the  bit  error  performances  of  the  three  receivers  under 
consideration,  a)  7  =  1,  b)  7  =  4. 

Figure  11:  A  comparison  of  the  exact  and  approximate  bit  error  probabilities  for 
7  =  1  and  7  =  4.  The  respective  optimal  values  for  M  are  also  shown. 

Figure  12:  Penalty  in  signal-to-noise  ratio  with  respect  to  the  case  with  no  phase 
noise  at  a  bit  error  rate  of  10~®. 

Figure  13:  Upper  and  lower  bounds  to  the  bit  error  probability  for  N-FSK  with 
7  =  1  and  iV  =  2,4,8,16. 
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